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ABSTRACT 

In   this  paper  we    consider  a   fundamental  problem  of  reaouroe   allooation  in  a  distributed  net- 
work.     Each   user's    demands    for   resources   may   ch4mge    dynamically   and   the   proo«»»ors   speeds    can   vary 
dynamically.      Let      v     be    the   maximum   number   of    users    competing    for    a  particular    resource    at    any 
time    instant.      I>et      k      be    the   maximum   number   of    resources    that    a   user   is   willing   to   get,    at    any 
time    instant.      This   problem  was   previously    formulated   in    [Lynch,    1980].       It   has    application    (1)    to 
two-phase    locking    in    databases       (2)    to  generalized  dining   philo»opherE  ,    and      (3)    to   the    implemen- 
tation  of    a  novel   extension   of   the   CSP    language,    called   Social-CSP    and   to  niany   other   applications 
to   concurrent  programming. 

Informally   we   say    that   an   algorit:hm   for   this   problem   is   real    time    if    its    response    time    is 
upper  bounded  by    a    function  which    does   not   depend  on   any   global   measure   of    the    system   of   processes 
and   resources,    except      k      and     v.       [Reif,    Spirakis  ,    1982bl    gave   the    first   known    real    time    algo- 
rithms  to   the   problem,    with    (mean)    response    time   of      O(v^).      This    response    time   may   be    too    lona , 
however,    in   applications   where      k      has    a   large   value. 

In   this   paper   we   provide   new   algorithms   whose    response    time    is   polunorrriaZ    to      v      and      k.      cXir 
Bost   efficient   new   algorithm   has    expected   response    time      0(vk).      Moreover,    our   constant    factors 
appear  to  be   small   enough   for  practical   applications. 

Unlike   our  previous   probabilistic   algorithms   of    [Reif,    Spirakis,    19e2b] ,   we    do   net    use    random 
delays    as   means   of    avoiding  process    starvation   and  of    achieving  probabilistic   fairness.       Instead, 
our   new   algorithm    utilizes      a     method  of  probabilistic  bidding,    to   resolve    contention   of    users    for 
resources.      This    technique    is   essential    in   our   achievement   of   polynomial    response    time. 

We    furthermore   prove    that   our    solution    is   optimal   with    respect    to   the    average    response    to   a 
user's    request.       In   particular  we   provide   matching    lover  bounds    for    any    distributed   algorithir.    for 
resource   allocation,    and  these  bounds   are  within   a   constant   factor   of   the    response   time   of   our  own 
algorithms. 

We    also  provide   a   suboptimal   algorithm  which    is   useful    for  inproving   the   throughput   rate   of 
resource    allocation   in   systems   where    the   majority   of   users    is   willing    to   get   only    a   few   resources. 
This    suboptimal    algorithm   does    use    reuidom  waits    combined  with   probabilistic  bidding.      These    tech- 
niques  enploy    limited  parallelism  within   each   process,    together  with    the   probabilistic   3i(_din9. 
(This    limited  parallelism  is    useful    in   achieving   optimal    response    time,    t:hough  we   wculd   still   get 
polynoc-ial   response   time  without   limited  parallelism.) 

1.    INTRODUCTION 

'•'      Resource   Granting   Systems.      In   this   paper,   we    consider   a   resource    allocation  problem  pre- 
viously   described   in    [Lynch,    1980]    and  generalized   in    [Reif,    Spirakis,    1982bl .      The   system   has 
potentially   an   infinite    set   of   processes      Tt      and   each  process   has    an   integer  name.      There    is    a 
potentially   infinite   set   of    resources      p      in   the   system.      Let      R  c  tt      be   the   set  of   processes   which 
control   resources.      The   set   of  user's  processes   is      U-tt  -r.      Each    resource      p(j)   €p      is    controlled 
by   a   distinct   granting  process      j.      This  process   has   the    responsibility   of  granting   each    resource 
to   a   distinct   user   process   in      U ,    so   no   two   different  processes   in      U      may   use   the    same    resource 
at   the   same   time.      We   assume    that   there   is    a  global   time  which   totally   orders   events,    but   pro- 
cesses  may   not  have   access   to  the  global   time.      User  processes   comnninicate   only  witji   those   granting 
processes    for  which   they   request   resources. 

A   system  as    above   is    called   a  Resourae   Granting  System    (hgs)  ,-    see    [Reif,   Spirakis,    1982bl  . 
The    goal   of    an    RGS    is    to   satisfy    dynamically    changing    user    requests    for   resource    allocation,    in    a 
distributed  way,    by   only    a   local    communication   between   granting   and    requesting  processes. 
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The  let  of  possible  schedules  of  processes  actions  is  determined  by  an  "adversary"  oracle  »V 
which  has  the  power  to  set  actions  in  the  worst  possible  way  to  increase  the  respxanse  time,  jj   has 
also  the  capability  to  select  at  time   t  -0   the  schedule  of  speeds  of  all  processes  at  all  times 
t  >0.   In  addition,  at  each  time   t  >0,  the  requests  by  user  processes   U   are  specified  by  jU. 
The  adverse  oracle  ^    is  restricted,  to  allow  users  to  keep  asking  for  their  resources  until  they 
are  granted.   In  practice,  no  such  .V  may  exist  but  coincidence  of  worst  case  situations  may  re- 
place its  action. 

Let  a  process  be  tame   during  a  time  interval   A,  if  for  any  interval  A'   whidi  intersects   A 
and  is  a  single  step  of  the  process,  then   |A'|  ^  ^'^'^in'^'max^   where   J^jj^^'  '^•ax  *"  fixed  real 

constants  with  0<r    <r    .   We  do  not  require  processes  to  be  tame  at  all  times.   However,  our 
mm   max 

pj-Qof  that  our  techniques  are  real  time  makes  use  of  the  assun^ition  that  processes  are  tame . 
Finally,  we  assume  that  processes  are  reliable   in  the  sense  that  they  perfectly  execute  their  pro- 
grams.  See  Sections  1.2  and  1.3  for  further  details  of  this  model  and  for  precise  definition  of 
response  time.   (Sections  1.2  and  1.3  may  be  skipped  in  a  first  reading  of  the  paper.) 

1.2   The  RGS  Model .   A  process  8tep   consists  of  either  an  assignment  of  a  variable,  a  test,  a  log- 
ical or  arithmetic  operator  or  a  no-op.   A  step  is  considered  to  be  a  finite  time  interval   A  m 
which  a  single  primitive  instruction  is  executed  just  at  the  last  moment  of  A,  and  no  other  in- 
structions are  executed  within   A.   Let  a  process  b«  tame   during  a  time  interval  A,  if  for  any 


be  tame  at  ail  times.   However  our  proof  that  our  algorithms  are  real  time  makes  an  assumption  that 
processes  are  tame.   We  assume  that  processes  are  reliable   in  the  sense  that  they  perfectly  execute 
their  programs.   The  rate  of  execution  varies  dynamically.   We  require  that,  at  no  time,  any 
granting  process   i  €R  simultaneously  grant  the  resource  p(i)   to  more  than  one  requesting  pro- 
cess.  We  also  require  that,  as  soon  as  a  process   j  €U   has  got  all  its  required  resources,  then 
it  can  keep  them  only  for  a  time  interval  whose  length  is  upper  bounded  by  a  fixed  parameter   6 
(containing  at  most  u   =  6/r  ,    steps,  if  the  process  is  tame).   Let   resource^(i)   be  the  set  of 
resources  that  process   i   is  requesting  at  rime  instant   t.   Let   )t   ^   be   |  resources^  (i )!  .   Let 
askers  (j)   be  the  set  of  user  processes  requesting   p(j)   for   j  €R   at  time   t.   Let   v     be 
[askers  (t)  i  .   We  assume  that  at  all  times   t>0,  v.     and   k.     are  upper  bounded  by  constants 
v,k   respectively  and  that   k  <v.   (This  does  not  necessarily  in^ily  any  bounds  on   1^   q  resour- 
ces (i)  |   or   |U  Q  askers  (j)|   for  any   i   in  n  or  j   in   R. 

With  respect  to  interprocess  communication  we  assume  (1)  that  each  resource  allocator   j  CP 
has  a  set  S   available  to  it  of  size  at  most   v,  containing  the  names  of  those  processes  willing 
to  get  the  resource.   As  in  (Lynch,  19801  we  assume  this  to  be  a  primitive  of  our  system  (it  could 
be  in?>lemented  by  a  queued  message  system,  for  example)   (2)  that  synchronization  must  be  done  by 
special  variables,  called  flags,    each  of  which  is  written  only  by  one  process  and  read  by  at  most 
one  other  process.   Read-write  conflicts  on  flags  are  excluded  by  our  process  step  semantics  and 
by  our  notion  of  global  time.   Flags  seem  to  be  the  simplest  primitives  for  synchronization  and 
lead  to  an  easy  in^Jlementation  (in  contrast  to  distributed  shared  variables  of  multiple  readers  and 
writers) . 

1-3   Implementations  and  Comploity  of  an  RGS.   An  implementaticm   of  an  RGS  determines  the  synchro- 
niration  algorithms  that  the  processes  run.   As  stated  above,  the  synchronization  algorithms  use 
only  flags  to  implement  the  synchronization  between  processes.   We  consider  a  time-varying  hyper- 
graph   Hj.   with  node  set   tt   and  time-varying  hyperedge  set   E^  -  { {  i  )  Uresource^  (i  )  1 1  €  U  }  ,  i.e.. 


where   a  hyperedge   at   time   instant      t      is   the   aet   of  nodes   of      n     consisting  of   a  single   process     i 
in     U      and   the   granting  processes   of   the    resources      i      is   willing   to   get    at      t.      An   RGS    iB^ile- 
■entation   dynamically    achieves   distributed   natchings    in   the   hypergraph      H    . 

For   each   adverse   oracle    u^,    let   the   reeparxBe    time    of    the    RZ   tj^3leinentatj.on   by    the    random 
variable      Y    ^  ^      which   is    the    length   of   the    smallest    interval      A     required   for   any   process      i  6u 
to  have      k      resource    request    simultaneously   granted,    given   that      i      requested   tbasc    resources 
during   the   entire   interval      A     and  assuming  that,    i      and  all   allocators   of  thm   rssource    requested 
by     i     within     d,   are   tame  within     t. 

Let   the  mean  reepanse     y^     be   the     max{mean{Y         }      over   all  oracles      a).      I<*t   the    C-reeptynae 
be   the   minimum     "^i.'^'       such   that    for   every   oracle    ^       probly      w^'Vu'^'^  ^^  "^-      "^^    ""^^    imple- 
mentation  is   real    time    if      Vc  €  (0 ,1]  ,Y|^  (E)    >0      and   upper   bounded  by   a    function   independent    of    any 
global  meoBure  of   the   network.       (Note:      A  global  measure  of   the  network   is    any  positive   function 
g      of     h  "  |t|      such   that      linL^  -   0).      Hence   if    an   PCS    iB^>le»entation   is    real   time,    then   the 

Bean   response      y        is    also   upper  bounded  by    a   function   independent   of    any   global  measure   of    the 
network. 

\.l*      Previous   Work.       [Rabin,    19B0a]    first    applied  probabilistic  choice    to   synchronization   problems 
in   distributed  systems    and  provided   a   solution   to  the   dining  philosophers  problem  which,   with 
probability   1,    is   deadlock    free   and   starvation    free.       [Rabin,    1980b)    applied  probabilistic   coordi- 
nation methods   to  synchronize   access   to  processes   to   a   critical  resource   in   a  apace-efficient  mannej 
[Frances   and  Rodeh,    1980]    and    [Itai    and   kodeh ,    1981]    also  proposed  probabilistic  techniques    for 
synchronization   and   leader  election  problems,    respectively. 

[Lynch,    1980]    first  posed  the    localised  resource   allocation  problem  as   a   formal   synchroniza- 
tion problem.      Let   the   resource   graph     G     be   the   graph  whose   nodes   axe   the    resources   and   two   re- 
sources  are    connected  by   an   edge   if   there   is   ever   a  user  process   requesting  both  of   them,   maybe   at 
different   times.      Let      X(G)      be   the    chromatic  number   of     G.      The  in^jlementation  proposed  by    Lynch 

was   a  deterministic  one   in  which  processes   should  know   the   color  of  each   resource    in   a   coloration 

y  (G) 
of      G.      The    response   time    achieved  in    [Lynch,    1960]    is    of   the   order   of      )((G)v*         •!      where      T      is 

the   time  necessary   for   interprocess   communication.      This  was   not   a  real  time   in^jlementacion   since 

X(G)      is     n(|Ti|)      in  general. 

[Reif,    Spirakis,    1982b]    provided   the    first    real   time    RGS   ijiplementation ,    with   mean    response 

time     0(kv        log   v)  .      In  that  previous  work,   we   used  the    techniqx»es   of  probabilistic   selection   of 

processes  by   resource   allocators    and   random  waits   to   avoid  adver»e   schedules   of  speeds  which  might 

be   set   \ip  by   the   oracle.      Although   this   was    a   real   time   in^ilementation ,    it  was   still   exponential 

in     k. 

1.5     The   New   Results    of   This    Paper.      We   shall   present    (in   Section   3)    a  probabilistic   implementation 
of   an   RGS,   with  mean   response      C(kv).      To  achieve   this    response,   we  maXe   essential   use   of   the 
probabilistic  bidding   technique,    together  with   use   of   limited  parallelism  within   each   user  process 
and  each   resource   allocator.      In  our  uniform  bidding  algorithm,  we   do  not  use   randoB  waits   to 
achieve  probabilistic  fairness.      Instead  we   use   only  the  probabilistic  bidding  technique.      In 
particular,   we   slice   the  time   of   each  process   into   rounds.      In  each   round  each   user  process   tries 
to  get   all   the   wanted  resources.      It   has   to  get   all   of  them  in  the   same    round.    -The   users    deny   the 
allocation  of   resources   to  them,    unless   all   the    required  resources    are   offered  to  be   allocated  in   a 
•mall   number  of   steps.      At   the   end  of   the    round,    users    release  their   allocated  resources    (if   any) 
and  make   a   fresh   start.      User   rounds   have   the   same    length   in  steps    for   all   users    and   this    length   is 
a  parar^ter  of   the   algorithm.      In   contrast,    resource    rounds   are  not   of   the   same    length   and  their 


4 
length   in   steps    i«   not   fixed   in   »<Jv«nce ,   but   adjusts   to   the    conditions   of   the    algorithm.      We    con- 
jecture  that   this    is   essential   in   avoiding   exponential   growth  of  the    response  with     K. 

We   also  prove   lower  bounds   of     HOcv)      for  the  worst   case   and  average    response   time   of   any 
algorithm   for   the    local   resource   allocation  problem.      Thus    our  proposed   technique    is    of   optimal 
performance   within   a  constant    factor.      We   also  provide    a  prioriU/  bidding  algorithm  which   has   mean 
response   tine  polynomial   in     X      (however   not  optimal )    and  is   useful   for  is^rorlng  the    throughput 
rate   of   resource   allocations   in   the   network.      In  particular,    it   allows   user  pzooasses   which   demand 
less   than     k      resources   to  have   higher  probability   of  being   assigned.      This   algorithm  has   the 
property   that   if   a  user     i  €u     has    a   request   of   et   most     k.   <k      resources,    thmn  it  has   mean   response 
0(vk      log   k    log(k^v) ) . 

l.t     App  I  i  cat  i  ons  .      Exjunptc    ?  •      SoCA.CLt  CSP .      An  extension   of   CSP,    defined  and  discussed  in 
IFrancez,    Reif,    1984),   has    an   efficient   in^jlMoentation   by   our  real   time   KHS.      Social-CSP  has    the 
following   new   cocmands : 

(1)  Extended  Output   Cormtand:       (p^    , .  . .  ,pj    )  1  (u    , —  ,u^)      in  which   the   sender  process   simulta- 
neously   sends    the   value      u^      to  proces      P j j .      t-l,...,k.      Here,    'simultaneously"   means    that    the 
receipt   of    a   value   by    a   process   named   in   the    output   command   does   not    affect    in    any  way    the    receipt 
of   the   values   by    other  processes   named   in   the    output    conmiind.      Note    that    (1)    can  be    considered   as 
the   generalization   of    a  broadcast    command. 

(2)  Extended  Input   Carvvand:       (p^^    , .  .  .  ,pj^    )  ?  (x    , .  ....,x^)      where   the    receiver  process   simulta- 
neously  gets   a  value    for   its   variable      Xj      from  process     p^    ,      t»l,...,k. 

Although   these   extended  input   and  output   coomvands    can,    in   theory,    be   simulated   in   Hoare's   CSP, 
it  is   not   clear  how  to  provide    an   efficient   simulation.      The   pot^r  of   the   new   constructs   of   Social- 
CSP   can  be   demonstrated  by   the   sin^jlicity   they   give   to  a  program  solving   the   k-fork  philosophers 
problem.       In    contrast,    it    is    not   known   liow   to   solve    the   k-fork  philosophers   problem  by    the    conven- 
tional  CSP    constructs.       (See    also   our    Example    2,    below.) 

Social-CSP    commands    caxi   be    directly    inplemented  by   our   RGS    real    time    in^slementation ,    by    con- 
sidering   the    sender   in    the    output    command    (respectively    the    receiver    in   the   input    command)    as    a 
user  process    and   the   processes      p,     ,  .  .  .  ,p.:  (respectively      Pi,  >  •  •  •  >Pj^.  )       as    resource    granting 

processes.      Note    that    our    iCTplementation   of    Social-CSP    allows    for   unspecified   or    computed   targets 
of    coonnuni cation ,    since    the    identities    of    the    resources    a    user   wants    may    change    dynamically. 
(This    IS    useful    in   case    of    routing   protocols    and  was    first    considered   in    (Francez,    1962J.) 

EmrnpAc    2  ■•      k- j^onk    Ph-U.O&ophz'V, .      As    a    simple   example   of    the    usefulness    of    RGS,    consider   a   general- 


ization  of    the    dining   philosophers   problem   to   the    case   where    each  philosopher    requires    k-forks    to 
eat.       (This   problem  was    first    considered    in    [Lynch,    1980)).      We   extend   it    to   the    case   where    the 
identities    of    the    forks    required    by    each   philosopher    change    dynamically.       Let    the    set    of    "forks"    be 

R  -    ir^ r^)      and   the    set    of    "philosophers"    be      U   -    lu    u^)      and    let      resources^  (u^ )    = 

^^i'^i^Dmod  n ^l*k-l)mod   n'      '^^     askers^(r.)    -    {u,,_^^^j^^  ^ "(i-l)mod   n'"i^      '°^    ^''    '• 

Our    new   resource    allocation    algorithm   achieves   mean    response    time      0(k    ).      In   contrast,    our  previous 
results    achieved   mean    response    time      0(k*''*'"'),     (see     iReif,    Spirakis  ,    1982bl  )  . 

Example    3:      Tu)o-Phcu,e    locfcx-ng    -en    Da.tabaj>ti  .       Two-phase    locking    is    a    concurrency    control    method   m 
databases;    for    a   survey    see    iBemstein,    Goodman,    1980).       It    has    the    feature    that    as    soon    as    a    trans- 
action   releases   a    lock,    it    never   obtains    additional    locks.      A   very    efficient    static   implementation 
of    two-phase    locking    can   be    achieved  by   our  methods.      Our    assumption    is    that    transactions    are 


allowed   to   act   on   the    data  only   if    they   got    all    the    locks    requested.       In   the    context   of    such    a 
database   system,    let   the   users   in     U     be    called   tranaaction  modulea  and   the  processes   of      R     be 
called  data  modulee.      If   the   readsets   of   the   transactions   are   of   cardinality   at  most      k      at   each 
tiBe   instant    and  if    at   most      v      transactions    can    coopete   for   a   lock   at   a   t±mm   li^tant      t,    then 
our  optimal    RGS  will    result   in   an      0(vk)      nean   response   txme   per   transaction.      Our   suJBoptimal    RGS 
achieves    an   even   smaller  mean   response   time   when      jreadset    (t) |   -0(k).      In   this    case,    if    a   trans- 
action wants   to   lock      k.       data   items    at   a   tiae ,    it   has   a  mean   response      0(vlc.    log (vk) log    k)  . 
(However,    this   becomes     0(vk   log(vk)loq   k)      when      |  readset .  (t)  |    -   k)  .      Our  ii^lei»entations   of   two 
phase   locking  proposed  in   this  paper   are   acyc^totically  more   efficient   than  tha   static   locking 
method  proposed   in    (Reif,    Spirakis ,    1982b],    whidi   had  a  mean   response      0(kv        ).      This    our   new 
algorithm  becomes   advantageous   in   cases   of   database   systems  with   small   granularity   of    locking    and 
hence   very    large   cardinality   of   transaction   readsets.      In  those   cases   other  kjio%*n   algorithms   are 
iapractical    since    they   have   response    time   exponential   in     k. 

2.    AN      n(kv)       LOWER   BOUND   FOR  THE    LOCAL    RESOURCE    ALLOCATION    PROBLEM 

THEOREM  1.  For  k  ■:  ccnd  v  >k,  there  is  a  network  in  which  at  least  one  user  proaees  has  to 
have  a  response   time   of  at    least      (kv-l)u     Btevs. 

Proof.      Consider   a   network   with    a   set   of   resources      R      such   that      |r|   "k      and   a   set   of   users      D 
such  that      |u|   -kv    .      Let      ^    he   an  oracle   such   that  all  processes   are   equispeed, 
synchronous   and   such   that      Vt>0,      V^  £    U,       | resources    (j ) |   >(k/2)    +1.      Then,    only    one    user  pro- 
cess   can  be   granted   all    its    resources    at   eSch   time   instant      t.      To   see    this,    assume    for   saXe    of 
contradiction  that   at   least  two  users      JiOn      *^®  granted  all  their  resoiirces   at  time      t.      Each 
resource   has   to  be  granted  to  only   one   user   at   a  time ,   thus   f oming   a  bipartite  matching   of   the 
hypergraph      H^,    as   defined  previously   in   Section   1.3.        Hence  we   conclude   that   the   number   of 
allocated   resources    at   time      t      is 

Iresources    (j.) I    +    | resources^ (j^) |    >k+2>k  for  all        j  >0      . 

This   implies   that   resources  will  be   allocated  to  processors   serially,    hence   the    last  process   of   the 
serial   order  will   have   a   response   time   of   at    least      Ckv-l)p      steps.      The   above   holds    independently 
of   the    synchronization   technique.  ° 

COROLLARY.      Our  probabilistic  bidding  algorithm  of  Section  2  haa  optimal  mean  response  within  a 
constant  factor. 

Proof.  By  Theorem  1  and  by  the  fact  that  given  any  multiset  of  serial  orders  of  kv  elements, 
there  is  at  least  one  element  whose  average  position  (over  the  multiset  of  orders)  is  at  least 
l)cv/2j.  ° 

3.    OUR   DISTRIBUTED   UNIFORM   PROBABILISTIC   BIDDING   ALCOr.lTHM 
We   assume   that   the    requesting  processes   communicate   only   to  the   resouros   allocators  whose 
resources   they  want    (or   have   been   allocated) ,    and  that   each   granting  process      j      is  willing   to 
communicate   only   to   the    requesting  processes   in   the   set      S.       (as    defined  in  Section   1.2).      The 
actions   of   the    requesting   and  granting  processes    are   time-sliced   in   rounds,    each   round   being    a 
repetition   of    a  basic   set   of   actions.      Processes   use    independent   sequences   of   probabilistic    choices 
as   the   basic  construct   to   counteract   adverse   speed   schedules   and  adverse    resource   demands    set   up 
by   the   oracle      V.      We    assume    that     .jJ    cannot   affect   or   foresee   the    results   of   these   probabilistic 
choices.      We   allow  each   user   in     V      and  each    resource   allocator  in      R      to  have   a   set   of   synchronous 


parallel   subprocesaes ,   which   aid  in  our   algorithas.      The   use   of   local  parallelism  here   is   not 
actually   essential   in  achievement  of  polynomial    response   tiine. 

3    )      An    Informal    Description  of   the   Rounds. 

,.      The    U&l'i'i    Hound.      A  user's   round  starts  with   the   user  drawing    (with  aqual  probability)    a 

random  nuB*>«r   in   the   set      (1,2 6)cv}      where      6  >1      is   an  integer.      If  th«   nuaber   drawn  was    less 

than      SXv,    the   user   remains   nonactive,    until   the   end  of  the   rounds.       (All  uaars'    rounds   talce   a 
predetermined  nuiii>er  of   steps.)      Else,    the   user   iasnediately  notifies    (by  tb*  as*   of   at  most      k. 
parallel   synchronous   subprocesses)    all   the   resource   allocators  of   the    resources   he  wants,    that  he 
is    a  winner.      Then,    the   user's   parallel   subprocesses   collect   answers   froo  the   resources    for   a 
p^j-iod  which   is   bounded  by   a   const^mt   number  of  steps.      During  this   period  sane   of   the    resources   may 
declare   that   they   agree   to  be   allocated  to  the  particular  user.      However,    if   at  that   time,    any 
otiiez   resource   requested  by   that  user   is   denied,    then   that  user  does   not   utilize   the    resources  which 
agreed  to  be   allocated  to  him,    but   he   continues   to  report   that  he   is   a  winner   to   all   of  his    re- 
quested resources   and  repeats   the   algorithm    (without   drawing   again) ,    until  the   user's    round  ends. 
If   all   of   the  wanted  resources    agree   to  be   allocated  at  the   saae  period    (in  which   the   user   collects 
answers),    then   the   user   utilizes   them  for      U      steps    (U      is   a   small   integer   constant,    as   in   Section 
2.1)    and   then  he    relea  these   resources.      This    is   done   in  parallel,   by  explicitly   notifying   the 

granting  processes   of   the    release,    using   his      )c      subprocesses.      Note   that  a   ccmnuni cation  with   all 
the     K      resource .  allocators   takes   only      r  time   due   to  the   limited  parallelism  and  tameness   of 

processes. 

b.  Tht   RuouAcz   KtLocaXoi'i    Round.      The   round  of  resource  allocator     j      starts  with  a 
mcmitonng  period  of   a   constant   number  of   steps   during  which   at  most     v     parallel   synchronous   sub- 
processes    continuously   monitor   the    users   of   the   set      £ 
of  winners   detected  during    the   monitoring   period.      If 
the   elements   of 

However,    if      M        has    a   unique   winner,    then   the   granting  process  notifies    the  winner   that    it    agrees 
to  be   allocated.      If   the   winner   does   not   accept   the   agreement  then   the   rounds   ends.      If   the  winner 
accepts,    then  the   round  enters   an  allocaticm  period.      During   this   period,    the   parallel   subprocesses 
of   the    resource    allocator   deny    all   appearing   winners.      The    round  now   ends   by    receipt   of      the   noti- 
fication  by   the    user   that   the    resource   has   been    released. 

c.  Addct^oncLl   Ranaiiii .      Note   that    coimnunication  with   all     v      of   the   user  processes    and  all   set 
operations   in   a   resource    allocator's    round  taJ;e   only   a   constant    (independent   of      v      and     k.)    number 
of   steps    due   to  the  parallelism  employed.      Note   also   that   the   following  holds  with   certainty: 

A  resource   decides    to   be   allocated   to  a   unique  winner,    only   after  the    resource    allocator   agrees 
to  allocate   the    resource    and   the   winner   accepts   the    agreement.      Thus,    no   resource    can   be    allocated 
to  more   than  one   user   at   the   same   time,   by   our  bidding   algoritJam. 

3.2      A   Detailed    Description   of    the   Unifonn   Bidding   Algorithm. 

a.     VctOLittLd  VcianA.vtion  of,   Va.'Uablu   ard  Ccmtcuiti   Uizd.      in  the   following,  we   set 


Uminj 


*  min 

Th«  u»«r8  use  the  following  flags:   For  user   i,  the  flag 
willing  to  get  resource   p(j).   The  flag  A.^  -1   iff  user   i  aootpte   the  allocation  of  the  re- 
source  P(j).   Both  flags  are  0  else.   The  flag   N^    is  initially  0,  it  bcooMS  1  when  user   i 
reteaaee   resource   j. 

The  resource  allocators   j   use  the  following  flags:   E   -0   if  the  r»»ource  is  denied  ajid  1 
if   j   agrees  that  its  resource  is  allocated  to  i.   Each  allocator   j   hA«  al»o  a  ehared    (for  its 
parallel  subprocesses)  variable   M.   which  allows  concurrent  reads,  and,  in  case  of  multiple 
writes  of  the  same  value,  their  sum  modulo  3  is  recorded.   This  can  be  done  in  constant  (3  steps) 
parallel  time  by  using  the  concurrent  read-e;tciu3ive  write  model  and  a  summation  binary  tree  of 
depth  3.   M   IS  used  to  count  winners  during  the  monitoring  period. 

Each  user   i   uses  also  a  shared   (for  all  its  parallel  subprocesses)  variable   L^ .   It  allows 
concurrent  reads  and  concurrent  writes  of  the  same  value.   L^   is  used  to  identify  situations  in 
which  all  wanted  resources  have  been  proposed  to  be  allocated  to  user   i,  at  the  same  time. 

The  counters   counter  ,  counter.   count  steps  of  respectively   i,3   in  a  round.   Note  also  that 
every  time  a  user  (or  resource  allocator)   p   (1)  modifies  a  flag  and  then   (2)  reads  a  flag  of  a 
resource  allocator  (or  user)   q   to  see  its  answer,  we  allow  for  \^   -  (r„,yr^^  r,' "  ■^  ^^^^^   between 
the  two  actions  of   p   (these  steps  allow  tor  at  least  2  steps  of  process   q   so  that   q   can  read 
the  asking  flag  and  answer  bacX) . 

We  now  present  formally  the  rounds  of  a  user   i   and  a  resource  allocator   j.   Note  that,  in  the 
code  which  follows,  the  section  of  code  between  cobegin ,  and  coend  is  executed  (in  a  synchronous 
fashion)  by  all  the  parallel  subprocesses  of  the  process  to  which  the  cobegin-coend  bloc)c  belongs. 

b.  Tht   liie.n.'i>   Hcund   i^on.  Uact  -c.   (Initially  W-A..-N..-0  yj«l,...,)c   and  L  -I) 

start   round 

L.    *   1 

1 

choose      X      randomly   uniformly    from      {l  ,2  , .  .  .  ,BJcv} 

if      X  f'Blcv     then   do     c-1     no-c¥>s ;    go  to   finish 

repeat :      cobegin    ( cocment   in  parallel   for     j  « 1 k) 


coend 
if      L.   -1      AND      counter.    <  c-p      then 
begin 

cobegin    {canroent   All   resources    allocated} 

>nt    accept) 


else 


use   resource      P(j)       for      y      steps 

N       ♦!      {cocmnent    release    resource} 

coend 

N   .  ♦0;    A.  .  -0;    wait   until      counter,   -c;      go  to   finish 

ij  13  i  

end 

begin 

cobegin      A.  .   -^0    {comment    deny    allocation;    coend 

if      counter     <c  -U  -2      then   go   to   repeat   else   wait   until      counter^ 

end 


finish:   end  round 


c.     The.  Ti^ou^vct.  *ZLoaitoi'6   Kound  (,ok  KllocaXoi     i.      (initially     K.  -0,  E      -0,   a.  ,  -o) 
local   a 


■tart   round 


do  until    (counter,  -X        or     M.   >0) 


if     counter.   <A        then    (do  no-op  until      counter.  •^5) 


coend 
cobegin 


if      (M.  -0     or     «.  -2)      then 

begin  E  .  ♦0;   go  to  finish  end 
else 


{conment :   Resource  allocated.   Await  release  by  user} 

go  to  finish 
end 
end 
else  (repeat   E  .  -^0   until  resource  deallocated) 
end 
coend 
finish :   en(i  round 

3.3  PROPERTIES  OF  THE  UNIFORM  BICDING  ALGORITHM 
In  the  following  we  assume  all  processes  tame. 

PROPOSITION  1.  If  a  particular  user     i     ie  a  winner  (i.e.    eeleate     x  •  BkvJ  in  its   current  round, 
then  its     k  parallel  synchronous  Bubprocesees  vill,   at   least  onae  in  its   current  round,   report  at 
the  same   time   that     i     ie  a  thinner  when  all  requested  resources  are  in  a  monitoring  period. 

P£oof .   Assume  that  user   i   has  just  been  declared  a  winner  in  its  current  round  because  i  got 
as  an  outcome   x  -  Bkv   in  the  probabilistic  selection.   Within  one  of  its  stepa  (at  most 
'^max'^'^min   °^  ^  resource  allocator's  steps)  all  resource  allocators  are  notified.   If  some  resource 
allocator  is  in  an  allocating  period  at  the  time  of  notification,  then  it  is  going  to  enter  a 
monitoring  period  by  at  most  a  number  of  its  steps  equal  to  the  allocating  period.   If  a  resource 
allocator  was  in  a  monitoring  period  at  the  time  of  notification,  then  it  shall  continue  being  in 
such  a  period  (since  his  resource  cannot  be  allocated  to  another  winner,  due  to  Uie  presence  of 


winner     i).      So,   by   at  ■ost   a  number  of   steps  equal   to  an   allocating  period  irom  the   time      i      (je- 
cided   that     i    is    a  winner    (i.e.,   by     <    '^■^^'  '^majt'^'^min'         "^"P"   °'      ^'  '    *^^    ^''    "»«>"rces    are    going 
to  be   notified,    at   the   sane   time,   within   their  monitoring  period,    that      i      ia    a  winner.  a 

PROPOSITION  2.      Given   that  a  partiaulccr  user     i     ia  a  uimer  in  its   current  roiaid,   the  probability 
that     i   Btaya  a  unique  xjiimer  for  all   ita  wanted  raaourexe  during  tha  whoU  roimd,   ia    lauer  bounoea 
by  a  aonatant,   independent  of     k     or     v. 

Proof.      We   need  the    following   definition: 

DEFINITION.      Let   a   drau  by   a   user  be   a   random  independent   selection  of  one   of  tha   numbers 

{1,2 Bkv). 

First  we  prove   Lenina   1 

LEKttA   1.      During  a  round  of  user     i,   any   other  dietincrt  user     j,   carpeting  for  the  earv   reeource  can- 
not  drau  for  more   than     6  -   r       /r    .      +1     timee. 
max     min 

P  roof   of    Lemma    1 .      The   maximum  number   of   rounds    of   user      j,    overlapping  with   the    round   of   user      i,    is 

r        /r    .      ♦  1       (because   the   maximum   ratio   of   speeds    cannot  exceed      r      _/r  and      j      may   have    drawn 

max     min  max^    nu.n 

at   most   once   in  each   round)  .      The    "plus   1"    i.s   to  take   into  account   the   fact   that   a   rouno   of   user      ] 
may  partially   overlap   the   beginning    of    the    round  of   user      i.  D 

Now,   given  that   user      i      is   a  winner,    the  probability   that     i      remains   a  unique  winner   during   his 
current   round   is   equal   to  the   probability   that  none   of   the    coo5>eting   users   manages   to  be   a  winner 
within     i's      round.      The   number  of   the    con^jeting   users    is   at  most     kv      (at  most     v      con?)etitors  per 
each  of  the     k      resources   asked  by   user     i)    and  each   coopeting   user  c&n  draw   for  at  most      6      times 
within     i's      round    (by    Lemma   1).      The  probability   of  each   draw   failing  to  win   is      1   -    l/6kv,    hence 
the   probability   that      i      stays   a  unique   winner  during   his   current   round  is   at   least 

/  1    \"^  1 

(l-— )  >-         ,  Where      e.2.73...         . 

TWEOREK  2.      The  probability   that  a  user  ia   allocated  all  hie  wanted  reeourcea   in   its  current   round  ie 
upper  bounded  by     l/6kv     and  Uxjer  bounded  by     l/2e6):v,      e  -2.73  — 

Proof.      The   probability      £_(T    ,A)  ,    for   oracle    ^    and  history      F    ,    that   a   user      i      is 

allocated   all    its  wanted    resources    in   his    current   round  starting   at      t,    never   exceeds      l/6kv      due    to 

the    fact   that     prob{user   i    chooses   to  be   a  winner   in  his    current   round)   «    l/6kv.      Given   that   user     i 

(looses   to  be    a  winner,    if     i    remains   a  unique  winner   during   all  of  his    current   round,    i      is    going 

to  be    allocated   all   of   his    resources    (due    to  Proposition    1)    with   certainty.      Multiplying  probaiji lities 

given  by   Proposition   2,    we   get   that 

2e6kv 

THEOREM   3.        Our  uniform  bidding  algorithm  has   z-reepanse     0(kv   log(l/£))      and  mean  response     0(kv). 

Proof ■      Let      u      be   the    number   of    rounds    required   for   user      i      to  be   granted   all   its      k      resources    in 
some    round,    given   that      i      starts    requesting    them   at   time      t        and   also   assuming   any   histor>'   of    the 
system   up   to      t^      and   any    oracle    u/.       Let    round      i      start    at    time      t^      i  <n.      We   have   by   Baye's 
formula        Prob(u-m)    -(l-£(r       ,A))...(l-£(r  ,A))-£(r       ,A).      By   use   of   Theorem   2,    we    get 

1  m-1  m 
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/        1    \*"^   1 

Prob(u-m)    <     1   -  -j^gj^l  gj^ 


If      u(C)      i»   the    least   number   sucii   that     Prob{u  >u(E)}   <   t,    than 


u(c)   < 


^'^(^  -   2^7) 


Sinc«      log  II 

round   taXes      2+p      stops.      So,    it   is   enough   for  the    length  of   the   user's    round  to  be   equal   to 

c  -    (2(2+U)  +1)  (r        /r   ,    )    ,   by   the  proof  of  Proposition   1.      This   iaplies   that   the   duration  of   a 

max     mln 

user's   round  is    at   most      cr  and  so    (for     U      independent   of     )c,v)      Prob{Y  ^   .    <cr        -uto)   >  1-c. 

nax  «jv  fX  max 


m 


Note :   Theorems  2  and  3  imply,  with  probability  1,  that  our  algorithms  never  deadlock,  no  process 
starves,  and  our  algorithm  is  probabilistically  fair,  in  the  sense  that  each  willing  user,  gets  its 
resources  infinitely  often  in  an  infinite  time  interval,  with  probability  1. 

k.    THE  PRIORITY  BIDDING  ALGORITHM 

'♦.1   Mot  i vat  i  on.   Theorem  1  provided  lower  bounds  for  systems  which  are  saturated  with  requests.   In 
practice,  systems  will  not  continuously  have  so  many  requests  at  all  times.   Ttds  section  provides  an 
algorithm  which  gives  good  response  time  in  the  case  in  which  the  system  is  not  saturated,  (though  it 
is  not  asymptotically  efficient  for  saturated  systems,  as  the  algorithm  of  Section  3). 

't.2   Description  of  the  Priority  Bidding  Algorithm. 

a.   ^OLXild  {cl    ilte.1     -c.   The  round  starts  with  the  user  waiting  for  a  randomly  chosen  n-imber  of 

•  teps ,  uniform  in  an  interval  upper  bounded  by  a  constant   c,  "  (2(2+u)+l)(r    /r  .  )    steps. 

1  majc      min 

Note   that      c,      is    chosen    in   such    a  way    that      c    -r  is   greater  than   the   maximum  possible    duration 

1  •'1      min 

of   the   useful  part   of   the   round.      The   rest   of   the    round  is    the   same   as   in  our  uniform  bidd.ing 

algorithm. 

^-      ^ound  i^on   ReiiOuAce  AZLoccUon.      1.      Each  round  of  process     j      is  split  into  a  sequence  of 

'log   k'      intervals.      For   each      m-0,...,'log   k',    in   each   interval      A        only   the   users      i      for  which 

k   e(ik/2         J  , 'k/2    ')      are   monitored.      Process      j       proceeds    to   the   next   interval      A      ,      only    if    all    use 

nv+l  m  "^^ 

processes   which    demand      k.       resources,    k   € [ ik/2         j , 'k/2    ']       have   been   allocated   their    resources. 

Within  each     A^,    the   resource   allocator   goes   through   a   sequence   of    "small   rounds",    each   small   round 

being   exactly   as   a   round  of   a   resource   allocator   in  our  uniform  bidding   algorithm  of   Section   3. 

^•3      Probabilistic   Analysis    of    the    Priority   Bidding   Algorithm.      Let   us    consider  a   time    interval      A^- 
i^t      u'      be    the    number   of    rounds    required    for   user      1      with      k.eiik/2"'*    j  ,  ik/2°'' )  ,    and   also    for    all 
users    conpeting   with   user      i,    to  have    all    resources    allocated.      The    number   of   those    users    is 
s   k/2      •    v.      Let      u      be    the   number   of    rounds    (within      A    ) ,    required   just    for   user      1.      Let      u(E') 
be    such   that      Probiu  <u (C ' ) }    >    1  -  E ' .      Then,      Prob{u'<i 


11 

Independent  actions  and  due  to  the  random  waits.   Let  us  set  C    -  (cAv)-2  .   Then 

kv 


Prob 


y<"(^)  >('-^)'"- 


leading   to   a  mean   response   of      01 —  logl  — J]     for   the    interval     A_.      This    determines  probabilistic 
upper  bounds   on   the    length   of   the   interval     A    .      Since   there   are    'log   )c '    such   intervals,    the 

,BH-1 


But,    from   the    analysis   of   the   uniform  bidding    algorithm    (Theorem   3)    and   fra«   the    fact    that,   within 
each     A    ,    the   algorithm   looks   exactly   liXe   the   uniform  bidding   algorithm,  ••  9«t 

interval     A 
e-response   for   a   user  with       K^  €  (  ik/2°^    ,)c/2"'n      will  be 

0  I  —  log  I  I  log   k I         implying   a  mean   response    of        O  (  — 

Va"  Ve-2'"/  /  \2'" 

For  users  with  small  demands (i.e. ,  when  k/?  «-o(k))  the  above  mean  response  is  better  than  the  mean 
response  of  the  uniform  bidding  algorithm  of  Section  3.  E.g.,  for  2°'  -  9(k)  we  get  a  mean  response 
0(v  logk  logv).      We   hence   conclude: 

THEOREM  k.      Our  priority  bidding  algorithm  has   the  follauing  property:     For  users     i     with  demcmdB 


o(h^v   logk   log  (Y)) 


0(h   V   log   k   log(h   v)  ) 
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